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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: Fred Hutchinson Cancer Research Center, Inc. 

(B) STREET: 1100 Faxrview Avenue North, Mailstop C2M-027 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE { ZIP) : 9810 9 

(A) NAME: Thomas Spies 

(B) STREET: 242 9 E. Aloha 

(C) CITY: Seattle 

(D) STATE: Washington 
<E) COUNTRY: USA 
(F) POSTAL CODE (ZIP) : 98112 

(A) NAME: Veronika Spies 

(B) STREET: 2429 E. Aloha 

(C) CITY: Seattle 

(D ) STATE : Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 98112 

TITLE OF INVENTION: CELL STRESS REGULATED HUMAN MHC CLASS I GENE 
NUMBER OF SEQUENCES: 16 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 {EPO} 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/02 9,044 

(B) PILING DATE: 29-OCT-1996 



m (ii) 



(iii) 
(iv) 



(vi) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CACTGCTTGA G CCGCTG AG A GGGTGGCGAC GTCGGGGCCA TGGGGCTGGG CCCGGTCTTC 6 0 

CTGCTTCTGG CTGGCATCTT CCCTTTTGCA CCTCCGGGAG CTGCTGCTGG TGAGTGGCGT 120 

TCCTGGCGGT CCTCGGCGGA GCGGGAGCAG TGGGACGTTT CCGGGGGTCG GGTGGGT AG C 18 0 



GGCGAGCGCT GTGCGGTCAG GGCGGGGCTC CTGTGC C CTG TCGGTGGCGC AGGGAGCTGG 
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ITS' 
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ACGCGGCCCG TTACCGCCAC ACTTCAGCCC TGCTTCCCCG TCACTTTTCA GTCCTCCTCG 3 00 

GGATCGCGCA TCACCTGCAC TTTCTGGTCT CCTCCTGCTC TTTCTCTCCT CGCGTCTCCT 36 0 

CCGCTTCCTC TCACTTTTCG GACAAACCAG TCCTTCTGAG GCCCATGGGT TCCCGGGCTG 420 

CCTCCGGGGC TGCTCCTGTG AATGGCATTC GAGTGCCCTT CCAGCGCGGC CACTGAAGCA 480 

GCCACAACCC CCGGTGCTCG GGGCGGCTCT CAGGTCCCTG AAGTCCTGTC CTCTCCCGGA 54 0 

GCCGACGTGT TCTCAGCTCC TGGGCCGCAG CTCCTGGAGT AGGGGCCCTC CTTTCTCGGG 600 

ACCCGGAGCT GGTGCTTCCT GCTGCTGTGG GGACTGTGGG GGGTCCTGAC TCTCAAGCTG 66 0 

AGGGGTTGGA GTCTGCAGGC TCCGGGCAGA GGATTCTTCC TGCGACTTCT CTCATCCCCA 72 0 

GCTCATTCTC CCCTCGCCTC TGGCTCCGAG GGTCCTCTCC TCTCTCTCAT CCCACCCCTA 78 0 

CTAATGACCA GTGATCTAAG GACACCAGAT TCCCTCTCAC CTCCTCCCTG CCCATCTCAG 84 0 

GGCCCGCTGA GTCCTTTTGC CCTCCCAGCT CCCTGCTACC CCTTCCTGTG TGCTGTTCTC 900 

TGATCCATTT CTAGGGTGTC CTCTGCCCTC ATCCCCTGTC CCCGCCACCG AAGTCCCTCC 960 

TGCACCCCTT ATGGGCCTTT CCTACAAGCA GCCTTCACCC AGTGCTGCCC CTATGCCTCC 102 0 

CCGTTCCCAA ATGTCCCTGA CTCTAACTTT CTGGTGCTGC CTTTTATCCG GGGGGGTCTT 1080 

CCCTCCATCC CACTCCCCTC CAGACCCCCA AGGGGAACCC TGATGCTAAT GGCAGTTGGG 114 0 

CCTTAGGCAG GGCGCAGGGC AGCGCAGATG CCCCCTCCCC TCCAGTGCAG ATGCCTGTTC 1200 



fj| TGGACCCTGC CTCATTGTGG CCCCTTCCCC ACTCCTTCAT CCTCAGCCTC ACCCTCTTGA 126 0 

Jg GGACCCCACC CTCCAGCCCA CAGGTGCTGG ACCATCCCTC CCTGGTCCCT CCGCCCCTCT 13 20 

|T CCACCTTGGG ACCTTGTGCT GCTCCTATCT CTTGCCCAGC TGCCTTGGGC CCTCAGCACG 13 8 0 

TTCTCATCTT TCAGTGGGAA AGTGGGAGTG CTGGAGCATA TGACAGTGCT GAGCATCTTT 1440 

CCCAAGCCCC ACCCTCCCCC AGAGCACCCT CCCCTCCTGT CCTCACCCTA CCCCAAGTTC 15 00 

TCCCACAGTC ACTCCTGCCC CATGCTCATG CCGCCCTCCA GTTCTTGCTC TGCCCATCTC 156 0 

CCCTCCCCAA CCCAGACCTA AAACAGGCTG TTGGGCCAAC TGTTCCTTGA CCTTCCTTCT 162 0 

TTTCTTTTGG TTCCTTGACC CCAGTGGGCT CTCACTCCCC ACACCGCATA TCTAAAATCT 16 8 0 

GTTTTGCCTG CTCTTGGGGT GCCACTGCTC CCCCTCCAGC ATTACTCCTT TTGGCAGGTC 174 0 

CTTCCTCAGG CTGAGAATCT CCCCCTCTAC CTTGGTTTTC TCTCTCTGGC CAGCACCCCC 18 0 0 

ACTCCTTGCT TTGTTTTTAA TTTTTAACTT TTGTTTGGGT ACGTAGTAGA TATATATGTA 18 6 0 

TATATTTATG GGGTACATGG GATATTTTGA CACAGGCCTA CAATATGTAA TAATCACATC 19 2 0 

AGGGTAAATG GGTTATATCA CAACAAGCAT TTATCCTTTC TTTGTGCTAC AAACAATCCC 198 0 



ATTATGCTCT TTCAGTTATT TTTAAATGTA 
GCTGTGCTAT CTACTAGATC TTATTCATTC 
TCCCTGCTCC CCCACTCCCC ACTACCCTTC 
TCTCCCCATG AGGTCCATTG TTTTAAATTT 
AGTTTGTCTG TCTGGGCCTG GGGCTTATTT 
AAATGACACG ATGGCTGAAT AGTTCTCCAC 
ATGCGTCTGT TGATGGACAC TTAGATTGCT 
CAATAAACAT GGAAAAGTAG ATAGCTCTTT 
TGCCTAACAG TGGGAGTGCT GGAGCATATG 
GAACCTCCAC ATTGTTTCCC ATAGTGGTTG 
ATCCTCACCA GCATTCCTTA TTTCTACATC 
M GGATAAAAGC CAGTTTATCT GGGGTGGGAT 

m 

m ATCTGTTGAC GAATGATGTT GAGCACCTTT 

yi CTTTTGAGAA ATGACTATTC AGATCTTTTC 

H TTCCTATAGT TGTTCGAGCT CCTTATATGT 

% GTTTGAAAAT ATTTTCTCCC ATTCTTGGAT 

iff GCTGTGCAGA AGCCTTTTTA CTTGATATGA 

J: TGTGCTTGTG GGGTATTACT TTAAAAATCT 

fl CCAATGTTTT CTTGTATAGT TTCATAGTTT 

TTGATTTGAT TTTTGTATAT GGTGAAAGAC 
TATCTAGTTT CCCCAGCACC ATTTTTGAAG 
TACCTTTGTT GGAAATGAGT TTAC TG TAG A 
TGTTTCATTG GTCTGTGTGT CTGTTTTTAT 
CTCTGTAGTA TAATTTGAAG TCAGATAATG 
GGATAGCTTT ATCTATTCTG GTTTTTTTGT 
TTATTTCTGT GAAGAATGTC ATTAGTGTTT 
CTTTGGGTAG TAT GG AT ATT TCAACAAAAC 
CTTTTCCATT TTTTGTGTCC TTCAATTTTT 
AGATGGAGTT TCACTCTTGT TGCCCAGGCT 
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CAATAAATTA TTGTTGACTG TACTCACCCT 204 0 

TAATTATATT TTTGTACCCA TTATTAACCA 2100 

TCAGCCTCTG GTAATCATCA TTCTATTGTC 216 0 

TGGCTGCCAC AAATAAGTGA GAACATGCAA 2220 

CACTTCACAG GATGACCTCC AGTTCTTTGC 2280 

ATACACATGT ACACCACATT TTCTTTATCC 2340 

TGCAGATCTT GGCTACTTTG AATAGTGCTG 24 00 

AATATACCGA TTTCCTTTCT TTGGAGTATA 24 6 0 

ACAGCTCTAT TGTATTTTTA GTTTTTGGAA 2 52 0 

TACTAGTTTA CGTTCCCACC AACAGTGTAC 2 580 

CTCGCCAGCA TTCCTTATTG CCTGTCTTCT 264 0 

GTTATCTCGT AGGAGTTTTG ATTTGCCTTC 2700 

TCATATACCT GTTTGCCATT TATATGTCTT 2 76 0 

TCATTTTTAA ATTGGATTAT TATATTTTTT 28 2 0 

TTCAGTTACT GATCCTTTGT CAGATGAATA 28 8 0 

GGTCTCTTCA TTTTGTTTAT TGTTTCCTTT 2 94 0 

TCCCATTTAT GCAATTTTAC TTTGGTTACC 3 000 

TTGCCCAGTC CAATATCCTA GAGAGTTTCC 3 060 

GAGGTCATAG ATTTACATCT TTAATCCACT 312 0 

AGGGTCTAGT TTCATTCTTC TGCATAAGGA 318 0 

AGACTCTCCT TTGCCAATGT GTGTTCTTGG 324 0 

TGTATGGAAT TGTTTCTGGG TTCTCTATTC 33 0 0 

GCCAGTATCA TGCTGTTTTG GTTACTGTAG 336 0 

TGATTCCTCT AGTTTTGTTC ATTTTGCTCA 3420 

GGTTCCATAT GCATTTTAGG ATTATTTTTA 34 8 0 

TGATAGGGAT TGCATTGAAT CTGTAGATTA 3 54 0 

TGATTCTTCC AATCCATGAA CGTGGACTAT 36 0 0 

TGCATCAGTG TTTTTTGTTT TTGGTTTTTG 3 66 0 

AGAATGCAAG GG TG TG AT CT TGGCTCACCG 3 72 0 



CAACCTCCGC CTCCCAGGTT CAAGCTATTC 
TACAGGCATG TGCCACTGTG CCTGGCTAAT 
TATGTTGGCC AGGCTAGTCT TGAACTCCTG 
CAAAGTGCTG GGATTACAGG CATGAGCCAC 
TTTATTGGAG AGGTCTTTCA CTTCTTCAGT 
TGTAGCTATT GTAAATGGGA TTCGTTTCTT 
ACTGATTTTT GCATGTTGAT TTTGTATCCT 
AATGGTTTTT TGGTGGAGTC TTTAGGTTTT 
AAGGATAATT TGACTTCTTC TTTTCCAGTT 
ATTGCTCTAG TTAGGATCTG CAGTACTGTG 
TGTCTTATTC CAGATCTTAG AGAAAAGGCT 
CP GCTGTGAGTT TGTCATATAT GGCTTTTATT 

Ifl TTTTTGAGAG TTTTTATCAT GAAGGGATGT 

Iff 

lH TTGAATGATA CTGGCTTTTG TCCTTTATTC 

f# TGTGTATGTT AAATCATCCT TGCATACCTG 

I 

~ TCTTTTTTAA TGTATTGTTG AATGTGGTTT 

o 

In GTGTTCATCA GGGATATAGG CCTGTAGTTT 

V- TTGTATCAGG ATATTCCTGG CTTTGTAAAA 

rf TTTTCAGAAC AGTTTGAATA GGACTGACAT 

AATTATACAT TACATAAATT TTACTGTTTT 
TAGATACATT CACATTTTTG TGCAACCCAA 
ATTCCTCCCT ACCTCTGGCC CCTGGTAACC 
ACCACTCTAG GTACCTCATT TAAGCAGAAT 
TATTTCACTT ATAATATTTT TGAGGTTCGG 
GCACTTTGGG AGGCTGAAGC AGGTGGATCA 
CCAACATGGT GAAACCCCAT CTCTACTAAA 
GGTGCCTGTA ATCCCAACTA CTTGGGAGGC 
GTGGAGGTTG CAGTGAGCTG AGATCAGGCC 
AATTCCATCT C C AA AAA AAA AAAATAAAAC 
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TTCTGCCTCA GCCTCCCAAG TAGCTGGGAT 3 780 

TTTCTATTTT TATTAGAGAT GGGGTTTCTC 3 84 0 

ACCTCAGGTG ATCCACCTGC CTCGGCCTCC 3 900 

CACGCCCAGC CACATCACTG TTTTATAGTT 3 960 

TAGGTTTATT CCTCAGTATT TTATTTTATT 4020 

GATTTCTTTT TCAGATTATT TGCTGTTAGC 4 080 

GCAACTTTAC TGAATTTGTT CTTCAGTTCT 4140 

TCCAAATATC AGACCACATG ATCTGCAAAC 4 200 

TTAATGCCCT TTCTTTCTTT CTCCTGTCTG 4260 

TTGCATAACT GTGGTAAAAT TAGTCATCCT 4320 

TTCAGTTTTC CCCCATTCAG TATGTTACTA 43 80 

ATATTGAGGT\ CTGTTCCTTG TATACTTAGT 444 0 

TGAATTTATC AAATGCTTTT TCAGTATCAA 4500 

TGTTGATATG ACGTATTACA TTGATTGATT 4 560 

GAATACATTC CACTTGCTCA TAAAGAATGA 462 0 

GCTAGTATTT CCTTGACGAT TTTTGCATCG 46 80 

TCTTTTTTAT GATGTGTCTT TGCCTGGTTT 4 74 0 

TGAGTTTGGA AGTATTCCCT CCTCCTCTAT 4800 

ATGTTGTTCT TTAAAAGTTT AATTGTGGTA 4 860 

AACCACTTTT AAGTGTATAC TCGGTGGCAT 4 92 0 

AACTCTGTGC CCATTAATCG GTAACTCCCC 4980 

ACCATTCTAC TTTTTGTTTC TATGAATTTG 504 0 

CATGTAATGT TTGTCTTTTT GTTTCTGGCT 510 0 

TGGGCACAGT GGCTCACGCC TGGATTTCCA 5160 

CCTGAGTTTC GGAGTTCGAA ACCAGCCTGG 52 2 0 

AATAATAAAA GTTAGCCGGG CGTGATGGCG 5 280 

TGAGGCAGGA GAATCGCTTG AATCCGGGAA 5 3 40 

ACTGCACTCC AG CCTGGGC A ACAAGAGTGA 54 0 0 

AATAATAATA ATAATATTTT TGAGGTTCAT 54 6 0 
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CCAAGTTGTA GTATGGGTCA GAATTTCATT CCTTTTAAGG ATGGATAATA CTCATTATAT 5 52 0 

GTATGTACCA CATCTTGGTT ATCCATCCCT CAGACAATGG ACACTTGGGT TACTTCTACC 5 580 

TTTTGGATAT TGGCAAATAT TTCATTTCCT TTGGGTATAT ATTTATTTCC TTTGGGTATT 564 0 

TCTTTTGGGT ATATATCCAG AAATAGAAGC AGTACACAGG GGCTTCATTT TCTCTGTCTC 5700 

TTTGCCAACC TTGCTCTGTG TGTGTGTGTA TGTGTGTGTG TAGGTGTGTG ATAACAGCCA 5760 

TCCTGATTGG TTTCAGGTGG CATCTCATTG TGGTTTGGAT TTGCATTTTC CTAATGAGTG 5820 

CTGATATTGA GCATCTTTTC ATGTGTTTGT TGATCATTTG TAATTTTCTT TGAAGAATTG 588 0 

GCCATTTAAG TCTTTTGCCC ATTTTTTCCC CCACATAGCT TCTCTTATCA GATATATGAC 594 0 

TTGCAATATT TATTTCATTT CGGGGTTGAT TGCTTTTTCA CTCTGATTGT GCCCTTTGAT 6000 

GCATAGATGT TTTGAATTTT CATCAGTCTA CTTTGTCAGT TCTTTCTATT CTATCTGTGC 6060 

TTTGGTGTCA TATCCATGAA AGCACTGTCA AATCCTATGT CATGAACATT ATCCCCAATG 6120 

TTTGCTTCTA AGAAATTTTT AGGTTTTAGT TCTTGAGTGT AGAGTTTAGG TCTTTGATTC 6180 

ATTTTGAGTT AATTTTTGTA TATAGTGCAA ATTAAGGGTC CAATTTTATT TTAACACCCC 6240 

CTGCCCCCAG AACTATTTGC TGAAAAGATC AACTGACTCT TTGTCACCTG CTCACCCCAG 6300 

TGGACACTAG CTGTTCCATC CAATTGCTGT CCTGGGGCCT TGTCATGCTA CTCTTCCACT 6360 

TTGAACCCAA GCCCACACCG TTCGTTGCTC CCCTCTGGGA TACTGACCCC ACTATAAACT 642 0 

TCTCTGGGGC TACAACCTTC CTACCCTTTG TGCCTCATGA CCACCCCCTC CCTTGTCCCC 6480 

GCCATGCCCA TGATGAGTCT CTTCTCGAGG CAGCTCCCCT TGCCTCCATC TCACCCTCAG 654 0 

CCTATGCACC ACAGCCACAC TGGACATGGG TCCCTCTGAG CCTGAGTCCC TTCCCATTCC 6600 

CACCATCTCC TCTGGCAAGA CCTTCCTTCC ACCACCTTCA TGCTCCTCCC TTGCCCCTGC 6660 

AGGGCAGCCT CTCCCCTTGG CCCCTATTCC CTTAGGGGGC TTGTGGCCAC CCAGTCCTTG 672 0 

CACCTGGCCT ACAAGTTTGC CATCTTCATT CCCCCTTCTT CTGTTCATCA GCCCCCTCCT 678 0 

CTATCCTCCC ACCCTCACAG TTTTCTTTGT ATATGAAATC CTCGTTCTTG TCCCTTTGCC 6840 

CGTGTGCATT TCCTGCCCCA GGAAGGTTGG GACAGCAGAC CTGTGTGTTA AACATCAATG 6 9 00 

TGAAGTTACT TCCAGGAAGA AGTTTCACCT GTGATTTCCT CTTCCCCAGA GCCCCACAGT 6 96 0 

CTTCGTTATA ACCTCACGGT GCTGTCCTGG GATGGATCTG TGCAGTCAGG GTTTCTTGCT 702 0 

GAGGTACATC TGGATGGTCA GCCCTTCCTG CGCTATGACA GGCAGAAATG CAGGGCAAAG 7 08 0 

CCCCAGGGAC AGTGGGCAGA AGATGTCCTG GGAAATAAGA CATGGGACAG AGAGACCAGG 714 0 

GACTTGACAG GG AACGGAAA GGACCTCAGG ATGACCCTGG CTCATATCAA GGACCAGAAA 7 20 0 
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GAAGGTGAGA GTCGGCAGGG GCAAGAGTGA 
GCAGAGAGCA GGGACCTGTC TCTTCCCACT 
GGGGTCAGTG GAACTCAGCA GGGAGGTGAG 
GGGAGGGCCA GGGAGGCGTA CCCCCTGGGC 
GGTTCGGGAA TGGAGAAGTC ACTGCTGGGT 
TAGGGTCTGT GAGATCCATG AAGACAACAG 
TGGGGAGCTC TTCCTCTCCC AAAACGTGGA 
CAGAGCTCAG ACCTTGGCCA TGAACGTCAG 
CAAGACACAC TATCACGCTA TGCATGCAGA 
ATCCAGCGTA GTCCTGAGGA GAAGAGGTAC 
CAATTCTGCT AGAGTTGCCT CACCTCCAAG 
5 GGATGAAGGC ATTTCCTGTT GGCACATCGT 

W TGGATAAAGA CAGTGGGTCA GGGACTGGAC 

W1 GACCCTCCGA CAGAATCCTG AGCCTGTGGT 

01 

H GCCAGGGCTG CCCCCTCTGC CTCCCAGCCT 

fy 

s ACAACCCAGG AGTCCACCCC TGACATCCCC 

Eft 

H| CCTGAGGCCA CAGTCCCAAG GCCCATCCTC 

> TGAGGACAGA CTTGCAGGTC AGGGGTCCCG 

GAGAAACAGC CCTGTTCCTC TCCCCTCCTT 
CTTTCTTCTC CAGTGCCCCC CATGGTGAAT 
ATCACCGTGA CATGCAGGGC TTCCAGCTTC 
CAGGATGGGG TATCTTTGAG CCACGACACC 
AATGGAACCT AC C AG AC C TG GGTGGCCACC 
ACCTGCTACA TGGAACACAG CGGGAATCAC 
AGGGTGACCC TGGAGAGGGT CAGGCCAGGG 
GCCCAGTGTA TAACAAGTCC CTTTTTTTCA 
CAGACATTCC ATGTTTCTGC TGTTGCTGCT 
TTCTATGTCC GTTGTTGTAA GAAGAAAACA 
GGGCAGTTTC TGGAGATGGT AAGGCCCCTG 



CTGGAGAGGC CTTTTCCAGA AAAGTTAGGG 726 0 

GGATCTGGCT CAGGCTGGGG GTGAGGAATG 732 0 

CCGGCACTCA GCCCACACAG GGAGGCATGG 7 380 

TGAGTTCCTC ACTTGGGTGG AAAGGTGATG 744 0 

GGGGGCAGGC TTGCATTCCC TCCAGGAGAT 7 500 

CACCAGGAGC TCCCAGCATT TCTACTACGA 7560 

GACTGAGGAA TGGACAGTGC CCCAGTCCTC 762 0 

GAATTTCTTG AAGGAAGATG CCATGAAGAC 768 0 

CTGCCTGCAG GAACTACGGC GATATCTAGA 7 74 0 

GGACGCTGGC CAGGGGCTCT CCTCTCCCTC 78 00 

ATGTGTCCAG GGAAACCCTC C CTGTGCT AT 78 6 0 

GTCCTGATTT TCCTCTATTG TTAGAGCCAC 7 920 

CATCCAGTGT TGTAATCAGG G C AAGT AG AG 7 98 0 

GGGTGTCAGG CAGGAGAGGA AGCCTTCAGG 804 0 

GCCCATCCTG GAGAGTTCCC TCCTGGCCCC 8100 

CTCCTCAGCA TCAATGTGGG GATCCCAGAG 816 0 

CTGCCAGCCT GGAAGAACTG GGCCCCAGAG 822 0 

GAGGGCTTCA GCCAGAGTGA GAACAGTGAA 828 0 

AGAGGGGAGC AGGGCTTCAC TGGCTCTGCC 8 34 0 

GTCACCCGCA GCGAGGCCTC AGAGGGCAAC 8400 

TATCCCCGGA ATATCACACT GACCTGGCGT 84 60 

CAGCAGTGGG GGGATGTCCT GCCTGATGGG 8 52 0 

AGGATTTGCC AAGGAGAGGA GCAGAGGTTC 8 58 0 

AGCACTCACC CTGTGCCCTC TGGTGAGCCT 8 64 0 

TAGGGACAGC AGGGATGGCT GTGGCTCTCT 8 70 0 

GGGAAAGTGC TGGTGCTTCA GAGTCATTGG 876 0 

GCTGCTGCTG CTATTTTTGT T ATT ATT ATT 8 82 0 

TCAGCTGCAG AGGGTCCAGG TGAGAAAAGC ,8880 

TC TG GG C AG T AGGGTCCCCT CATTGCTCCT 8 94 0 
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GCAAAGATAG GCATGTTGGT GACAAGGCTT CTGTAACAGG GGATGAAAGT TGGGG AATTT 9000 

GGGAAGGGAA TGGGGGCAGC ATCTCCATCT ACACCCATAA GTGCTGCCCA AGCGAGGGTC 906 0 

AAACGCCCAG CTGTGGCATC TTCCTGCTGC AGGTGAGGAG TGGGCAGCAG GGAGGGCTGC 912 0 

GGCGCCTGCT CTGTCCCCAT CCCGGTCTCT GTGTCTCTTG GACTCACTAG GGCGCATCCA 918 0 

GGTGGGGTGA GCTGGGAATC ACGTGCTGAA TGCTGAGGGC CTGGATGATC ACGGCCTCAG 924 0 

AGGGAGCAAA TAGTAAAGGC AGCTGTGATC TGGGGAGGGC CAGAAACTGG AGAGGAATCT 93 00 

GAGGAGAGGC GGTGCCCCTA TTCCCTTCCT CTCTGCATCC CCCTCCCCTG TTTCTCCAGC 93 6 0 

CATCGGGGCG GACACCGAGA AAAAGACCTA TGAGGCCCAG CCTGGGGGCC CTGCCTGTGT 942 0 

AGCCCTTTGG AGACCCCTAG TAACAGGGAG GGTCCTGAGC ACACATGGCC ATCTCTGTCC 94 8 0 

ACTGTGCAGC TCCCCATGCA CCTCCTCCAG GAG CTTTCTT GGGGTTGTCG TGTCCTCTGC 954 0 

ACCATTCGAG GCCCTACTCT TTCCAGGTTC CCACGGCCTG GCCTCCCTGA GTTTCTTGCA 9600 

GATGACATGG ATGAGTAGAT AAGCAGATGT CCCTGGGCCA TTTGAGGAGT GGGGCCCAGC 966 0 

CCCTCATCAG GGCAGCTGTG GTCCCTGTTT TCATCCTACC TCCGAGTGTT TTCTTCTCCA 972 0 

GTCCCTGAGG GACACAGTCC TCAGGGCCCA TGTTTTTGGG GATTTAATCT GTGCTCTGTG 9780 

GCCTCACCTT GCCTTCCCTG AGCCAATTTC CCTTTCTAAA GGTGGTCACT GCCTGGTAAG 984 0 

TTTGGAGTAA GGGACGGTCA GAATCATTTC CCCTACAGTC AGGTTGTTTG ATGGGGGATG 9900 

AAAAGAGACA GCAGGAAGTT TTGTGTTTCT GCAAAGACAG AAGCAGTTCA GGCGACAGTA 9 96 0 

AGAGGCTGGG GTGTCCAGGA GGGTGTGTCT GGCAGTAGGG TCGCTGGTTT CTCATCCTTG 1002 0 

AACCTAATTG CACTGTCAGT CGGCCCCTCA GGCCTGAGCA GATGGGAAGG TTTGTCCCCT 100 8 0 

GCCCTGCAGC AAGAGGGCCC TGTCCAGGAG GCACCCACAA CAGAGGCAGT GCAGGTCTGT 1014 0 

GGTCACTCCT ACTCTCACCT GTGGCGTCTC CCGTAGAGGG ATTGTCAGTT CTGGTTCCCT 10200 

GTGGGCAGGA ATGGTTTCCT CAT AGG TCAC TGGAGTTTTG GCCAGGAAAA GAGTATGAAG 10260 

TTCATGTGGC AGTTTCTCAA AATTCCTGCT TTCAATGTTG ATGTCCAGTA AAGATATTCG 1032 0 

TAATTTCAGC TCTATAATCT TAATAGGATT TCCTCTAATA TTGTGAAGCA TATTATATGA 10380 

AACAGGAACA CAAATTTCTC AAAATTCCTG CGATGTCCAA TAAAGATTTT CATAATTTCA 10440 

GCTCTGCAAT CTTAATAGGA TTTCCTAATA CTGTAAAGCA TATTAAATGA AACAGGAACT 10 5 00 

CAAATTTGGA GCCCCCTCTC CAGGAGGTTC TGTGTGGAGA TGGTGGCTGT GGCAGTGGCA 10 56 0 

GTTCCCAGGT GCAGAGGGTG GGCAGAGGCA GCCTCAGGCT AAGGGGTCTC CCCTACTCCA 106 2 0 

CATGGAGAAA ATCCCTTGTA GGTTGCAAGG GCAGTGGCCG GGTGGAATCC CTGCTAGGGA 10680 



CAGAGCAGGA AGGCCTCGCA GCCTCACCAA GCAGCAGCCC TGGGGTGGAG CTGCGTTTCC 1074 0 

AGGGTTAAGC GGACCAGGCA GGAGTAGCGG TTACTCAAGA GCAGGTGACA GGCTTGGGTT 108 00 

GTGAGGGTCA GGAGAGGCCA GGCCTCCTCG AGCAAGGTGG GGGTCCCAGG GTCAGGTCAG 10860 

GTGCAGATCC TGTGGCAGCC ACGTCTTTCC ATGCTGGGCC TGCTGGGCCC CCCAGGCTTC 10 920 

CTGATGGGGT CCCCAGTTAG GAGCTGCCTG CTCAGGGCTG GGAGGGGAGG AGCACTGAGC 10980 

TGCAGATAGA GGGCAGAGCC CACAGTGGGC AGGGCCTGCC CTGGTGTGTA GGTGCCTCTG 11040 

CAGGAGAGGA GGGCCTGGGG ACTGAGAGCA AGGGTCAGGG CCTCTCTTTG GGGAGGCCTC 1110 0 

TCACTGTAAC AGGACTGGTC AGGCCTGAGA GGAGGGCACT GGGTTCCCTC TTGGGTCTTG 1116 0 

TCCTTTAGTC TTGGGGCCCT TTCCCTCCCT GCACGATGAG TGGTGGG C AC AGGGCACGGG 1122 0 

CTGATGTTGA TGGAGTGATG GGAGGGAACT GGCAGGGGCT GGGAAAAGCA AGGAGGGAGG 112 80 

AAGAAAAAAG TGGGGGCCTC ATCTTCCCTC AGAGAAAGGG CAAATCTGGT TTTGGAGCAA 113 4 0 

CTGAAGAGAG AAAAGTCCCC AGGGAATAAA CACAACACTG CACCCAGTGG AGCATTTACC 114 0 0 

CATTTCCCTC TTTTCTCCAG AGCTCGTGAG CCTGCAGGTC CTGGATCAAC ACCCAGTTGG 1146 0 

GACGAGTGAC CACAGGGATG CCACACAGCT CGGATTTCAG CCTCTGATGT CAGCTCTTGG 1152 0 

GTCCACTGGC TCCACTGAGG GCGCCTAGAC TCTACAGCCA GGCGGCTGGA ATTGAATTCC 115 8 0 

CTGCCTGGAT CTC ACAAG C A CTTTCCCTCT TGGTGCCTCA GTTTCCTGAC CTATGAAACA 1164 0 

GAGAAAATAA AAGCACTTAT TTATTGTTGT TGGAGGCTGC AAAATGTTAG TAGATATGAG 11700 

GCATTTGCAG CTGTGCCATA TT 1172 2 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 38 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Leu Gly Pro Val Phe Leu Leu Leu Ala Gly lie Phe Pro Phe 
15 10 15 

Ala Pro Pro Gly Ala Ala Ala Glu Pro His Ser Leu Arg Tyr Asn Leu 
20 25 30 

Thr Val Leu Ser Trp Asp Gly Ser Val Gin Ser Gly Phe Leu Ala Glu 
35 40 45 



Val His Leu Asp Gly 
50 

Arg Ala Lys Pro Gin 
65 

Thr Trp Asp Arg Glu 
85 

Arg Met Thr Leu Ala 
100 

Leu Gin Glu He Arg 
115 

Ser Ser Gin His Phe 
130 

Val Glu Thr Glu Glu 
145 

Leu Ala Met Asn Val 
165 

Lys Thr His Tyr His 
180 

Arg Tyr Leu Glu Ser 
195 

Val Asn Val Thr Arg 
210 

Cys Arg Ala Ser Ser 
225 

Gin Asp Gly Val Ser 
245 

Leu Pro Asp Gly Asn 
260 

Cys Gin Gly Glu Glu 

275 

Asn His Ser Thr His 
290 

Ser His Trp Gin Thr 
305 

Ala He Phe Val He 
325 

Thr Ser Ala Ala Glu 
340 
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Gin Pro Phe Leu Arg Tyr 
55 

Gly Gin Trp Ala Glu Asp 
70 75 

Thr Arg Asp Leu Thr Gly 
90 

His He Lys Asp Gin Lys 
105 

Val Cys Glu He His Glu 
120 

Tyr Tyr Asp Gly Glu Leu 
135 

Trp Thr Val Pro Gin Ser 
150 155 

Arg Asn Phe Leu Lys Glu 
170 

Ala Met His Ala Asp Cys 
185 

Ser Val Val Leu Arg Arg 
200 

Ser Glu Ala Ser Glu Gly 
215 

Phe Tyr Pro Arg Asn He 
230 235 

Leu Ser His Asp Thr Gin 
250 

Gly Thr Tyr Gin Thr Trp 
265 

Gin Arg Phe Thr Cys Tyr 
280 

Pro Val Pro Ser Gly Lys 
295 

Phe His Val Ser Ala Val 
310 315 

He He Phe Tyr Val Arg 
330 

Gly Pro Glu Leu Val Ser 
345 



Asp Arg Gin Lys Cys 
60 

Val Leu Gly Asn Lys 
80 

Asn Gly Lys Asp Leu 
95 

Glu Gly Leu His Ser 
110 

Asp Asn Ser Thr Arg 
125 

Phe Leu Ser Gin Asn 
140 

Ser Arg Ala Gin Thr 
160 

Asp Ala Met Lys Thr 
175 

Leu Gin Glu Leu Arg 
190 

Arg Val Pro Pro Met 
205 

Asn He Thr Val Thr 
220 

Thr Leu Thr Trp Arg 
240 

Gin Trp Gly Asp Val 
255 

Val Ala Thr Arg He 
270 

Met Glu His Ser Gly 
285 

Val Leu Val Leu Gin 
300 

Ala Ala Ala Ala Ala 
320 

Cys Cys Lys Lys Lys 
335 

Leu Gin Val Leu Asp 
350 



%0 
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Gin His Pro Val Gly Thr Ser Asp His Arg Asp Ala Thr Gin Leu Gly 

355 360 365 

Phe Gin Pro Leu Met Ser Ala Leu Gly Ser Thr Gly Ser Thr Glu Gly 
370 375 380 

Ala 
385 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGGCCATGGG GCTGGGCCGG GTCCTGCTGT TTCTGGCCGT CGCCTTCCCT TTTGCACCCC 6 0 

CGGCAGCCGC CGCTGAGCCC CACAGTCTTC GTTACAACCT CATGGTGCTG TCCCAGGATG 120 

AATCTGTGCA GTCAGGGTTT CTCGCTGAGG GACATCTGGA TGGTCAGCCC TTCCTGCGCT 18 0 

ATGACAGGCA GAAACGCAGG GCAAAGCCCC AGGGACAGTG GGCAGAAGAT GTCCTGGGAG 24 0 

CTAAGACCTG GGACACAGAG AC CG AGG ACT TGACAGAGAA TGGGCAAGAC CTCAGGAGGA 3 00 

CCCTGACTCA TATCAAGGAC CAGAAAGGAG GCTTGCATTC CCTCCAGGAG ATTAGGGTCT 36 0 

GTGAGATCCA TGAAG AC AG C AGCACCAGGG GCTCCCGGCA TTTCTACTAC GATGGGGAGC 42 0 

TCTTCCTCTC CCAAAACCTG GAGACTCAAG AATCGACAGT GCCCCAGTCC TCCAGAGCTC 48 0 

AGACCTTGGC TATGAACGTC ACAAATTTCT GGAAGGAAGA TGCCATGAAG ACCAAGACAC 54 0 

ACTATCGCGC TATGCAGGCA GACTGCCTGC AGAAACTACA GCGATATCTG AAATCCGGGG 6 00 

TGGCCATCAG GAGAACAGTG CCCCCCATGG TGAATGTCAC CTGCAGCGAG GTCTCAGAGG 66 0 

GCAACATCAC CGTGACATGC AGGGCTTCCA GCTTCTATCC CCGGAATATC ACACTGACCT 72 0 

GGCGTCAGGA TGGGGTATCT TTGAGCCACA ACACCCAGCA GTGGGGGGAT GTCCTGCCTG 78 0 

ATGGGAATGG AACCTACCAG ACCTGGGTGG CCACCAGGAT TCGCCAAGGA GAGGAGCAGA 84 0 

GGTTCACCTG CTACATGGAA CACAGCGGGA ATCACGGCAC TCACCCTGTG CCCTCTGGGA 9 00 

AGGTGCTGGT GCTTCAGAGT CAACGGACAG ACTTTCCATA TGTTTCTGCT GCTATGCCAT 96 0 

GTTTTGTTAT TAT T ATT ATT CTCTGTGTCC CTTGTTGCAA GAAGAAAACA TCAGCGGCAG 102 0 

AGGGTCCAGA GCTTGTGAGC CTGCAGGTCC TGGATCAACA CCCAGTTGGG ACAGGAGACC 108 0 

ACAGGGATGC AGCACAGCTG GGATTTCAGC CTCTGATGTC AGCTACTGGG TCCACTGGTT 114 0 
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CCACTGAGGG CGCCTAGACT CTACAGCCAG GCGGCCAGGA TTCAACTCCC TGCCTGGATC 
TCACCAGCAC TTTCCCTCTG TTTCCTGACC T ATG AAAC AG AAAATAACAT CACTTATTTA 
TTGTTGTTGG ATGCTGCAAA GTGTTAGTAG GTATGAGGTG TTTGCTGCTC TGCCACGTAG 
AG AG CC AG C A AAGGGATCAT GACCAACTCA ACATTCCATT GGAGGCTATA TGATCAAACA 
GCAAATTGTT TATCATGAAT GCAGGATGTG GGCAAACTCA CGACTGCTCC TGCCAACAGA 
AGGTTTGCTG AGGGCATTCA CTCCATGGTG CTCATTGGAG TTATCTACTG GGTCATCTAG 
AGCCTATTGT TTGAGGAATG CAGTCTTACA AGCCTACTCT GGACCCAGCA GCTGACTCCT 
TCTTCCACCC CTCTTCTTGC TATCTCCTAT ACCAATAAAT ACGAAGGGCT GTGGAAGATC 
AGAGCCCTTG TTCACGAGAA GCAAGAAGCC CCCTGACCCC TTGTTCCAAA TATACTCTTT 
TGTCTTTCTC TTTATTCCCA CGTTCGCCCT TTGTTCAGTC CAATACAGGG TTGTGGGGCC 
CTTAACAGTG CCATATTAAT TGGTATCATT ATTTCTGTTG TTTTTGTTTT TGTTTTTGTT 
TTTGTTTTTG AGACAGAGTC TCACTCGTCA CCCAGGCTGC AGTTCACTGG TGTGATCTCA 
GCTCACTGCA ACCTCTGCCT CCCAGGTTCA AGCACTTCTC GTACCTCAGA CTCCCGATAG 
CTGGGATTAC AGACAGGCAC CACCACACCC AGCTAATTTT TGTATTTTTT GTAGAGACGG 
GGTTTCGCCA AGTTGACCAG CCCAGTTTCA AACTCCTGAC CTCAGGTGAT CTGCCTGCCT 
TGGCATCCCA AAGTGCTGGG ATTACAAGAA TGAGCCACCG TGCCTGGCCT ATTTTATTAT 
ATTGTAATAT ATTTTATTAT ATTAGCCACC ATGCCTGTCC TATTTTCTTA TGTTTTAATA 
TATTTTAATA TATTACATGT GCAGTAATTA GATTATCATG GGTGAACTTT ATGAGTGAGT 
AT CTTGGTG A TGACTCCTCC TGACCAGCCC AGGACCAGCT TTCTTGTCAC CTTGAGGTCC 
CCTCGCCCCG TCACACCGTT AT CG ATT ACT CTGTGTCTAC TATTATGTGT GCATAATTTA 
TACCGTAAAT GTTTACTCTT TAAATAAAAA AAAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 

Met Gly Leu Gly Arg Val Leu Leu Phe Leu Ala Val Ala Phe Pro Phe 
15 10 15 

Ala Pro Pro Ala Ala Ala Ala Glu Pro His Ser Leu Arg Tyr Asn Leu 
2 0 '> 5 3 0 
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Met Val Leu Ser Gin Asp Glu Ser Val Gin Ser Gly Phe Leu Ala Glu 
35 40 45 

Gly His Leu Asp Gly Gin Pro Phe Leu Arg Tyr Asp Arg Gin Lys Arg 
50 55 60 

Arg Ala Lys Pro Gin Gly Gin Trp Ala Glu Asp Val Leu Gly Ala Lys 
65 70 75 80 

Thr Trp Asp Thr Glu Thr Glu Asp Leu Thr Glu Asn Gly Gin Asp Leu 
85 90 95 

Arg Arg Thr Leu Thr His lie Lys Asp Gin Lys Gly Gly Leu His Ser 
100 105 110 

Leu Gin Glu He Arg Val Cys Glu He His Glu Asp Ser Ser Thr Arg 
115 120 125 

Gly Ser Arg His Phe Tyr Tyr Asp Gly Glu Leu Phe Leu Ser Gin Asn 
130 135 140 

Leu Glu Thr Gin Glu Ser Thr Val Pro Gin Ser Ser Arg Ala Gin Thr 
145 150 155 160 

Leu Ala Met Asn Val Thr Asn Phe Trp Lys Glu Asp Ala Met Lys Thr 
165 170 175 

Lys Thr His Tyr Arg Ala Met Gin Ala Asp Cys Leu Gin Lys Leu Gin 
180 185 190 

Arg Tyr Leu Lys Ser Gly Val Ala He Arg Arg Thr Val Pro Pro Met 
195 200 205 

Val Asn Val Thr Cys Ser Glu Val Ser Glu Gly Asn He Thr Val Thr 
210 215 220 

Cys Arg Ala Ser Ser Phe Tyr Pro Arg Asn lie Thr Leu Thr Trp Arg 
225 230 235 240 

Gin Asp Gly Val Ser Leu Ser His Asn Thr Gin Gin Trp Gly Asp Val 
245 250 255 

Leu Pro Asp Gly Asn Gly Thr Tyr Gin Thr Trp Val Ala Thr Arg He 
260 265 270 

Arg Gin Gly Glu Glu Gin Arg Phe Thr Cys Tyr Met Glu His Ser Gly 
275 280 285 

Asn His Gly Thr His Pro Val Pro Ser Gly Lys Val Leu Val Leu Gin 
290 295 300 

Ser Gin Arg Thr Asp Phe Pro Tyr Val Ser Ala Ala Met Pro Cys Phe 
305 310 315 320 



Val He He He He Leu Cys Val Pro Cys Cys Lys Lys Lys Thr Ser 
325 330 335 
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Ala Ala Glu Gly Pro Glu Leu Val Ser Leu Gin Val Leu Asp Gin His 
340 345 350 

Pro Val Gly Thr Gly Asp His Arg Asp Ala Ala Gin Leu Gly Phe Gin 
355 360 365 

Pro Leu Met Ser Ala Thr Gly Ser Thr Gly Ser Thr Glu Gly Ala 
370 375 380 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ACTGGGGAAC AAGGTTTATA TGAGA 2 5 



CP 
ill 
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(2) INFORMATION FOR SEQ ID NO : 6: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGTCACCCGT CTTCTACAGG ACCC 24 

(2) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGGGCCATGG GGCTGGG 17 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION- SEQ ID NO: 8: 
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ATCTGAGATG TCGGTCC 17 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
CGTTCTTGTC CCTTTGCCCG TGTGC 2 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AACCCTTCCC TTACCCCCGT CGTAG 25 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TATGTAAAAC GACGGCCAGT TTCACCTGTG ATTTCCTCTT CCCCA 4 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 
GGTCTTTTCA ATCCCCGTCT CTCGTCCAGT ATCGACAAAG GACAT 4 5 

(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D} TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TATGTAAAAC GACGGCCAGT TTCGGGAATG GAGAAGTCAC 4 0 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGAGAGGAGA GGGAGGTTAA CCAGTATCGA CAAAGGACAT 4 0 

<2) INFORMATION FOR SEQ ID NO: 15: 



l0 (i) SEQUENCE CHARACTERISTICS: 

111 (A) LENGTH: 4 0 base pairs 

01 (B) TYPE: nucleic acid 

fe4 (c) STRANDEDNESS : single 

fy (D) TOPOLOGY: linear 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Lfl 

Li TATGTAAAAC GACGGCCAGT GTTCCTCTCC CCTCCTTAGA 4 0 

pSB! 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



AAAAAGTCCC TTTCACGACC ACCAGTATCG ACAAAGGACA T 
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